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Abstract 

We have analyzed the statistical probabilities of limit-order book (LOB) shape through 
building the book using the ultra-high-frequency data from 23 liquid stocks traded on the 
Shenzhen Stock Exchange in 2003. We find that the averaged LOB shape has a maximum 
away from the same best price for both buy and sell LOBs. The LOB shape function has 
nice exponential form in the right tail. The buy LOB is found to be abnormally thicker for 
the price levels close to the same best although there are much more sell orders on the book. 
We also find that the LOB shape functions for both buy and sell sides have periodic peaks 
with a period of five. The 1-min averaged volumes at fixed tick level follow lognormal 
distributions, except for the left tails which display power-law behaviors, and exhibit long 
memory. Academic implications of our empirical results are also discussed briefly. 
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1 Introduction 



In an order-driven market, limit-order book (LOB) is a queue of orders waiting 
to be executed and it is the base of continuous double auction mechanism. Orders 
in the book are sorted according to price-time priority. The construction of LOB 
is a dynamic process. Effective limit orders whose prices do not penetrate the op- 
posite best price are stored in the book, while an effective market order with the 
price penetrating the opposite best immediately causes a transaction and removes 
the corresponding orders in the opposite book. In addition, cancelations can also 
remove the orders in the LOB. 



The price levels in the limit-order book are discrete. The difference between two 
adjacent price levels is the tick size u. It is 0.01 RMB for all stocks in the Chinese 
market. The price level A at any given time t can be defined as follows 

{(Pb — p) fu + 1 for buy orders 
V 11 (1) 
(p — Pa) /u + 1 for sell orders, 

where p is an allowed price in the LOB and pt and p a are the best bid and best 
ask, respectively. According to the definition, A = 1 stands for the position at the 
best bid (ask) in the buy (sell) LOB. Denote Vb(A, t) (respectively V S (A, t)) as the 
volume at level A in the buy (respectively sell) LOB at event time t. V&(A, t) and 
V s (A,t) can be viewed as the instant LOB shape functions on the buy and sell 
sides, respectively. 



The LOB shape function is of crucial importance in the research of market mi- 
crostructure theory of order-driven markets. A brief discussion is in order. The 
shape of the LOB affects a trader's strategy and thus influences order aggressive- 
ness yj]. Second, the LOB shape determines the virtual price impact. The price im- 
pact I (to) of a virtual market order of size u can be determined as follows [si Hi 

I(w) = u x sup jn : ^ V(A, t) < w| . (2) 

It is found that the virtual price impact is much stronger than the actual impact 
and large price fluctuations are not necessarily caused by large orders but rather the 
liquidity [51, |6Q. It is rational that a large trader prefers to split his large order and 
submit when the opposite LOB is thick such that the price does not change much. In 
contrast, an impatient small trader might submit an small order when the opposite 
LOB is thin for small A's, since usually he does not have ensuing orders. The 
optimal trading strategy of a large order also depends on the average LOB shape 
0,181], which could be improved if one considers the instant LOB shape function 
rather than the average. 
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When we want to investigate the aforementioned topics analytically, the LOB shape 
function is usually treated as continuous. In the derivation of an optimal execution 
strategy, many unrealistic LOB shape functions have been proposed [0, 0]. This 
makes the framework less useful in practice and calls for a realistic shape function. 
Indeed, the empirical LOB shape function has been investigated in different stock 
markets. Bouchaud et al. found that the LOB shape of individual liquid stocks 
on the Paris Bourse (February 2001) is symmetrical for buys and sells and has a 
maximum away from the current bid (ask) [91] . They also found that the distribution 
of order size at the bid (or ask) can be fitted by a gamma distribution [0]. Potters 
and Bouchaud investigated three stocks traded on the Nasdaq Stock Market and 
found that all the LOB shape functions are buy/sell symmetric and only one stock 
reaches a maximum before relaxation IllOn . Similar results on the shape function 
are also reported using other market data [0, 5 , U]. 



In this paper, we shall study in detail the LOB shape of 23 liquid stocks traded on 
the Shenzhen Stock Exchange (SZSE) in China. The rest of the paper is organized 
as follows. In Section [21 we describe briefly the database we adopt. Section [3] in- 
troduces the average shape of buy and sell LOBs. We then discuss in Section|4]the 
probability distributions and time dependency of volumes at the first three best. The 
last section concludes. 



2 Data sets 



The Chinese stock market is a pure order-driven market where orders are matched 
resulting in transactions. Our data contain ultra-high- freq uency data of 23 liquid 



stocks listed on the Shenzhen Stock Exchange in 2003 fll2H . We find that the results 
for different stocks are qualitatively similar. Hence we will present the results for a 
very liquid stock. In 2003, only limit orders were allowed to submit and the market 
constituted opening call auction, cooling period and continuous double auction. We 
focus on the LOB in continuous double auction. 

As an example, our presentation is based on the order flow data for a stock named 
Shenzhen Development Bank Co., LTD (code 000001), whose time stamps are 
accurate to 0.01 second including details of every event, with the information con- 
taining date, order size, limit price, time, best bid, best ask, transaction volume, and 
aggressiveness identifier (which identifies whether a record is a buy order, a sell or- 
der, or a cancelation). The database totally records 3, 925, 832 events, including 
1, 718, 156 buy orders, 1, 595, 961 sell orders, 598, 750 cancelations and 12, 965 in- 
valid orders. Using this nice database, we rebuild the LOB according to the trading 



rules fl 1 3h and study the statistical probabilities of LOB shape. 
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3 Averaged shape 



In the continuous double auction mechanism, order placement adds volume to the 
book, while order cancelation or transaction removes volume from the book. It 
is clear that these three types of events (order placement, order cancelation and 
transaction) can change the shape of the LOB. In what follows we use event time, 
not clock time. In this way, the event time t advances by 1 when an event occurs. 
At every time t, we have an instant LOB shape V& jS (A, t) on each side (buy or sell). 
The averaged shape of the buy (sell) LOB can be calculated as follows 



M 



vua) = -£h, s (a,0 



(3) 



t=l 



where M is the number of total events in 2003 for the stock we analyzed. 

It is known that traders tend to place their orders on the same best price [@, Oil 
14l LL5|, ll6fl. On the other hand, the orders placed near the same best have a higher 
execution probability, and impatient traders are likely to make a cancelation when 
these orders are not executed immediately. It is thus not clear what is the LOB 
shape under these opposite forces. Fig.Q]shows the shapes of buy and sell LOBs. 
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Fig. 1. LOB shape V(A) as a function of relative distance A for buy and sell limit-order 
books in log-linear coordinates (a) and linear-log coordinates (b). 

In Fig. [Ha), we in general find that the LOB shape function has a maximum away 
from the same best (A = 1) and is roughly symmetrical to the maximum, which 
consists with the result of Bouchaud et al. [|9j]. The LOB shapes are asymmetric be- 
tween buy orders and sell orders. The LOB shape V(A) increases when A ^ A max 
and decreases afterwards, where A max = 4 for buys and A max = 11 for sells. We 
note that only two (000088 and 000539) of the 23 stocks do not have clear maxima 
and the values of A max vary from stock to stock. In addition, the total volume of 
sell orders is greater than that of buy orders, which is especially visible for large A. 
This phenomenon is also observed for other stocks except that two stocks (000088 
and 000089) have comparable buy and sell volumes, which is consistent with the 
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fact that the Chinese stock market in 2003 was in the middle of a long-lasting bear- 
ish antibubble from 2001 to 2005 IU7I1 and more market participators tended to sell 
their shares. 



There are two more features arise in the empirical LOB shape function. Although 
there are more sell limit orders in the book, the buy LOB is still thinker than sell 
LOB for small A in Fig.QJa). In 2003, only the information on the first three visible 
levels (A = 1, n 2 , and n 3 such that the instant LOB shape function V(ni) ^ 0, 
V(ri2) 7^ and V(A) = for other relative distances less than n 3 ) were disposed to 
traders. We find that, 10 stocks have thicker sell books, 10 stocks have thicker buy 
books, and the other three have comparable book thickness. This observation is very 
interesting since the traders faced a very strong illusionary signal that there were 
more buy orders while the market was bearish. Another interesting feature is the 
presence of periodic peaks at A = 5n + 1 for n — 0, 1, 2, • • • , which are observed 
in all 23 stocks. The periodic peaks are higher for sell orders than buy orders. 
The underlying mechanism of this universal behavior is unclear, which might be 
related to the trading strategy of larger traders or people's irrational preference of 
some numbers like 5, 10 or their multiples 111 811 . These two features call for further 
investigation, which is however beyond the scope of this work. 



In Fig. QIb), we show the shape functions in linear-log coordinates to study the 
functional form for large A. The volumes in both buy and sell LOBs decrease 
exponentially, 

V b , s (A) ~ e-^ A . (4) 

Using least-squares fitting method, we obtain that (3b = 0.044 ± 0.0004 for buy 
LOB and j3 a = 0.025 ± 0.0002 for sell LOB. The decreasing speed of buy LOB 
is faster than that of sell LOB, which means that there is a larger proportion of 
more aggressive orders in the buy LOB than in the sell LOB. It seems that buyers 
pay more attention to the execution probability, while sellers consider the return of 
their investigation more important. We notice that most of other stocks have similar 
exponentially decreasing shapes. In contrast, Bouchaud et al. have found that the 
LOB shape tails have power-law behaviors for the three liquid stocks traded on the 
Paris Bourse In addition, V& :S (A) abruptly plummet to zero at the tail ends, 
which is caused by the 10% price fluctuation limitation compared to the close price 
on the previous trading day. 



We have studied the event-time averaged volume placed at each tick levels in the 
LOB. However, the volume may have large fluctuations and greatly deviate from 
the mean. It is necessary to analyze the fluctuations of volumes at each tick levels. 
Here, we study the standard deviation a as a function of the relative distance A, 

that is, 

*m(A) = \j (Vfe iS (A) 2 ) - (H,s(A)) 2 . (5) 

The standard deviations for buy and sell LOBs are presented in Fig. [2] We find that 
the functional form of cr(A) is very similar to that of the shape for both buy and 
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sell LOBs. The standard deviation er(A) increases with A at the first few levels and 
then decreases exponentially. When comparing the buy and LOBs, the sell LOB is 
found to be thicker with larger fluctuations. 




4 Statistical properties of volumes at individual tick levels 



4.1 Probability distribution 



We have analyzed the averaged volume above. Here we focus on the time averaged 
volume over a fixed clock time interval St at individual levels 



1 N 

v biS (A,t) = -Y / V b , s (A,t i 



(6) 



where t { is the time moments of the N events occur in the interval (t — St, t] and iV 
is a function of t and St. We use St — 1 min to calculate the time-averaged volume 
at each price level. 



Fig.[3]shows the probability density functions (PDFs) for A = 1, 2, and 3. In Fig. [3] 
(a), we find that In v in general is normally distributed 



f(\nv) 



2ixa 



cxp 



(lnw — fi)' 
2na 2 



(7) 
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that is, v is log-normally distributed with the PDF beingH] 

p(v) = f(\nv)/v (8) 

This is also different from the Paris Bourse stocks where the volumes on the best are 
distributed according to a Gamma distribution [9]. With the increase of the relative 
distance A, the mean of In v, fi, increases, which is line with the result in Fig. \T\ 
We can also project that fi decreases for large A. More generally, we find that the 
1-min volumes at other tick levels for different stocks are basically lognormally 
distributed. 




Fig. 3. Probability density functions /(In?;) of 1-min averaged logarithmic volumes at the 
first three tick levels on the buy LOB in a linear-linear scale (a) and linear-log scale (b). The 
curves corresponding to A = 2 and A = 3 in (b) have been vertically translated downward 
for clarity. The results are similar on the sell side. 

When v is small, we find that the empirical curves deviate from the lognormal 
distribution /(In?;). We plot the probability density functions /(In?;) of In?; in a 
linear- log scale, which is presented in Fig.[3](b). It is clear that the small volumes 
v deviate from the corresponding lognormal distributions and exhibit power-law 
behaviors 

/(In?;) ~ v Pa or p(v) ~ v^' 1 . (9) 

Using least-squares fitting, we obtain that (3\ = 4.19 ± 0.09 (2.2 < log 10 v < 3.5) 
for A = l,p 2 = 2.61 ±0.03 (2.1 < log 10 v < 4.2) for A = 2, and p 3 = 2.67±0.05 
(2.1 < log 10 v < 4.2) for A = 3. 

4.2 Long memory 

Temporal dependency can be quantitatively assessed by the autocorrelation func- 
tion C(£), which describes the average correlation between two points with time 
lag L Many processes have the autocorrelation function decaying exponentially 

1 Denote g{y) and h(x) the PDFs of y and x, respectively. If y is a function of x, we have 
g(y)dy = h(x)dx. It follows immediately that h(x) = g(y)dy/dx = g(lnx)/x. 
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(C(£) ~ e ~ e / £ ° for £ — > oo), which means these processes exhibit short mem- 
ory with a characteristic timescale £ . On the other hand, when the autocorrelation 
function is not integrable, for example, C{£) decaying as a power-law behavior 
(C(£) ~ £~ 7 ), the process has long memory without any characteristic timescale, 
which means that the values in the past have potential predictive power for the 
future. 



The property of temporal dependency is equivalently characterized by the Hurst 
index H, and the relationship between the autocorrelation exponent 7 (assuming 
C(£) ~ £-"<) and the Hurst index H can be expressed by 7 = 2 - 2H lfl9ll2oll. 
Detrended fluctuation analysis (DFA) is a popular method to estimate the Hurst 
index 11191 |2U 12211 . We perform DFA on the 1-min averaged volumes at the first 
three tick levels on the buy LOB. The detrended fluctuation functions F(£) are 
presented in Fig. |4] Sound power-law relations are observed in the three curves 
and the Hurst indexes are H x = 0.76 ± 0.01 for A = 1, H 2 = 0.83 ± 0.01 for 
A = 2, and H 3 = 0.81 ± 0.01 for A = 3, respectively. With the Hurst indexes H 
significantly larger than 0.5, we argue that the 1-min averaged volumes at the first 
three tick levels exhibit long memory. Quantitatively similar results are observed 
for the sell LOB and for other stocks. This agrees well with the fact that order signs 
have long memory 11231. 12411 . 






Fig. 4. Plot of the detrended fluctuation functions F(£) of 1-min averaged volumes at the 
first three tick levels on the buy limit-order book. The results corresponding to A = 2 and 
A = 3 have been vertically translated downwards for clarity. 



5 Conclusion 



We have investigated the limit-order book shapes of 23 stocks traded on the Shen- 
zhen Stock Exchange in the whole year 2003. For brevity, we presented the results 
of a very liquid stock (Shenzhen Development Bank Co., LTD, 000001). For most 
of the stocks, the averaged shape has a maximum away from the same best and 
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the volumes in the LOBs decrease exponentially. The LOB shapes are asymmet- 
ric between buy and sell orders and the sell LOB shape relaxes much slower. The 
probability density functions of 1-min averaged volumes at the first three tick lev- 
els follow lognormal distributions with a power-law behavior for small volumes 
in the left tails. Using detrended fluctuation analysis, we confirmed that the 1-min 
averaged volumes at a fixed tick level on the LOB exhibit long memory. When com- 
pared with the Paris Bourse stocks we find that the LOB shapes are qualitative 
similar but quantitatively different. 

Several problems arise that need to be addressed: why the buy LOB is abnormally 
thicker for the price levels close to the same best and why there are relatively large 
volume on the tick levels of A = 5n + 1? It is also noteworthy that our results on 
the empirical LOB shape functions can be used to develop more realistic optimal 
trading strategy for large traders. 
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